Recombination Hotspot/Coldspot Identification Combining Three Different Pseudocomponents via an Ensemble Learning Approach

نویسندگان

  • Bingquan Liu
  • Yumeng Liu
  • Dong Huang
چکیده

Recombination presents a nonuniform distribution across the genome. Genomic regions that present relatively higher frequencies of recombination are called hotspots while those with relatively lower frequencies of recombination are recombination coldspots. Therefore, the identification of hotspots/coldspots could provide useful information for the study of the mechanism of recombination. In this study, a new computational predictor called SVM-EL was proposed to identify hotspots/coldspots across the yeast genome. It combined Support Vector Machines (SVMs) and Ensemble Learning (EL) based on three features including basic kmer (Kmer), dinucleotide-based auto-cross covariance (DACC), and pseudo dinucleotide composition (PseDNC). These features are able to incorporate the nucleic acid composition and their order information into the predictor. The proposed SVM-EL achieves an accuracy of 82.89% on a widely used benchmark dataset, which outperforms some related methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Stimulation of meiotic recombination in yeast by an ARS element.

In a previous study, meiotic recombination events were monitored in the 22-kb LEU2 to CEN3 region of chromosome III of Saccharomyces cerevisiae. One region (the hotspot) was shown to have an enhanced level of both gene conversion events and reciprocal crossovers, whereas a second region (the coldspot) was shown to have a depressed level of both types of recombination events. In this study we ha...

متن کامل

Combining Classifier Guided by Semi-Supervision

The article suggests an algorithm for regular classifier ensemble methodology. The proposed methodology is based on possibilistic aggregation to classify samples. The argued method optimizes an objective function that combines environment recognition, multi-criteria aggregation term and a learning term. The optimization aims at learning backgrounds as solid clusters in subspaces of the high...

متن کامل

Combining Classifier Guided by Semi-Supervision

The article suggests an algorithm for regular classifier ensemble methodology. The proposed methodology is based on possibilistic aggregation to classify samples. The argued method optimizes an objective function that combines environment recognition, multi-criteria aggregation term and a learning term. The optimization aims at learning backgrounds as solid clusters in subspaces of the high...

متن کامل

Opioid hedonic hotspot in nucleus accumbens shell: mu, delta, and kappa maps for enhancement of sweetness "liking" and "wanting".

A specialized cubic-millimeter hotspot in the rostrodorsal quadrant of medial shell in nucleus accumbens (NAc) of rats may mediate opioid enhancement of gustatory hedonic impact or "liking". Here, we selectively stimulated the three major subtypes of opioid receptors via agonist microinjections [mu (DAMGO), delta (DPDPE), or kappa (U50488H)] and constructed anatomical maps for functional locali...

متن کامل

Tethering recombination initiation proteins in Saccharomyces cerevisiae promotes double strand break formation.

Meiotic recombination in Saccharomyces cerevisiae is initiated by the creation of DNA double strand breaks (DSBs), an event requiring 10 recombination initiation proteins. Published data indicate that these 10 proteins form three main interaction subgroups [(Spo11-Rec102-Rec104-Ski8), (Rec114-Rec107-Mei4), and (Mre11-Rad50-Xrs2)], but certain components from each subgroup may also interact. Alt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره 2016  شماره 

صفحات  -

تاریخ انتشار 2016